Picture for Zhiqiang Xie

Zhiqiang Xie

Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving

Add code
May 06, 2025
Viaarxiv icon

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Add code
May 02, 2025
Viaarxiv icon

AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution

Add code
Nov 05, 2024
Figure 1 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 2 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 3 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Figure 4 for AI Metropolis: Scaling Large Language Model-based Multi-Agent Simulation with Out-of-order Execution
Viaarxiv icon

RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)

Add code
Sep 04, 2024
Figure 1 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 2 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 3 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Figure 4 for RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)
Viaarxiv icon

Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight

Add code
Jul 11, 2024
Viaarxiv icon

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

Add code
Jul 01, 2024
Figure 1 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 2 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 3 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Figure 4 for CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents
Viaarxiv icon

Blockchain-enabled Trustworthy Federated Unlearning

Add code
Jan 29, 2024
Viaarxiv icon

Efficiently Programming Large Language Models using SGLang

Add code
Dec 12, 2023
Viaarxiv icon

High-throughput Generative Inference of Large Language Models with a Single GPU

Add code
Mar 13, 2023
Figure 1 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 2 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 3 for High-throughput Generative Inference of Large Language Models with a Single GPU
Figure 4 for High-throughput Generative Inference of Large Language Models with a Single GPU
Viaarxiv icon

Dual-side Sparse Tensor Core

Add code
May 20, 2021
Figure 1 for Dual-side Sparse Tensor Core
Figure 2 for Dual-side Sparse Tensor Core
Figure 3 for Dual-side Sparse Tensor Core
Figure 4 for Dual-side Sparse Tensor Core
Viaarxiv icon